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Abstract. Lasserre's hierarchy is a sequence of semidefinite relaxations for 
solving polynomial optimization problems globally. This paper studies the re- 
lationship between optimality conditions in nonlinear programming theory and 
finite convergence of Lasserre's hierarchy. Our main results are: i) Lasserre's 
hierarchy has finite convergence when the constraint qualification, strict com- 
plementarity and second order sufficiency conditions hold at every global min- 
imizcr, under the standard archimedean condition; the proof uses a result of 
Marshall on boundary hessian conditions, ii) These optimality conditions are 
all satisfied at every local minimizer if a finite set of polynomials, which are 
in the coefficients of input polynomials, do not vanish at the input data (i.e., 
they hold in a Zariski open set). This implies that, under archimedeanness, 
Lasserre's hierarchy has finite convergence generically. 



1. Introduction 



Given polynomials /, hi 



(1.1) 



Qj in x £ 



l n , consider the optimization problem 



mm 
s.t. 



hi(x) = (i = 1, . . . ,mi), 
9j(x) >Q(j = l,...,m 2 ). 

Let K be the feasible set of (jl.ip . When mi = (resp. mi — 0), there are no 
equality (resp. inequality) constraints. For convenience, denote h := (hi, ... , h mi ), 
g := (<7xj ■ • ■ ,9m 2 ) an d fJo '■= 1- A standard approach for solving (ll.l[) globally is 
Lasserre's hierarchy of semidefinite programming (SDP) relaxations [llj . It is based 
on a sequence of SOS type representations for polynomials that are nonnegative on 
K. To describe Lasserre's hierarchy, we first introduce some notation. Let R[x] be 
the ring of polynomials with real coefficients and in x :— (x±, . . . , x n ). A polynomial 



p e R[x] is said to be SOS if p 
SOS polynomials is denoted by 
integers), denote 



Pi 



(h) 



2 k 



mi 



i=l 



■ ■ + p\ for pi, . . . ,pk e R[x]. The set of all 
For each k £ N (N is the set of nonnegative 



each cj)i £ M[x] 
and deg((j>ihi) < 2k 



= (3) 



each a j G 



and deg(ajgj) < 2k 
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The set {h)2k is called the 2/c-th truncated ideal generated by h, and Qk{g) is called 
the fc-th truncated quadratic module generated by g. Lasserre's hierarchy is the 
sequence of SOS relaxations (fc G N is called a relaxation order): 

(1.2) max 7 s.t. / - 7 G (h) 2k + Qk{g)- 

The SOS program (| 1 . 2[) is equivalent to a semidefinite program QT|. We refer to 
[12l H4] for surveys in this area. 

Let f m in denote the minimum value of (jl.ljl and fk denote the optimal value 
of ()1.2p . Clearly, < / m j„ for all fc and {fk} is monotonically increasing. Under 
the archimedean condition (i.e., R — T,f =1 xf E (h)it + Qt(g) for some i £ N and 
R > 0), Lasserre obtained the asymptotic convergence fk — > fmin as k — > 00, by 
using Putinar's Positivstcllcnsatz (cf. Theorem 12. II) . When fk = f m i n for some k, 
we say Lasserre's hierarchy has finite convergence. When h(x) — defines a finite 
set in the complex space C™, Laurent (T3] proved that Lasserre's hierarchy has 
finite convergence. Indeed, when h(x) = defines a finite set in E™, the sequence 
{fk} also has finite convergence to f m in, as shown in [22 . There exist examples 
that Lasserre's hierarchy fails to have finite convergence, e.g., when / is the Motzkin 
polynomial x^x^ixl+x^ — SxD+x® and K is the unit ball [20l Example 5.3]. Indeed, 
such examples always exist when dim(K) > 3 (cf. Scheiderer [27l Prop. 6.1]). 

However, in practical applications, Lasserre's hierarchy often has finite conver- 
gence, e.g., as shown by numerical experiments in Henrion and Lasserre [51 UOj. 
The known examples for which finite convergence fails are created in very special 
ways. Since Lasserre proposed his method in [llj . people are intrigued very much 
by the discrepancy between its theory (only asymptotic convergence is guaranteed 
theoretically) and its practical performance (in applications we often observe finite 
convergence). The motivation of this paper is trying to resolve this discrepancy. 
Our main result is that Lasserre's hierarchy has finite convergence when a finite 
set of polynomials, which are in the coefficients of / and all hi,gj, do not van- 
ish at the input data, under the archimedean condition. This implies that, under 
archimedeanness, Lasserre's hierarchy has finite convergence generically. (We say a 
property holds generically if it holds in the entire space of input data except a set of 
Lebsgue measure zero.) To prove this, we need to investigate optimality conditions 
for (fLll . 

We here give a short review of optimality conditions in nonlinear programming 
theory (cf. [U Section 3.3]). Let u be a local minimizcr of (jl.ip and J(u) = 
{ji, . . . , j r } be the index set of active inequality constraints. If the constraint qual- 
ification condition (CQC) holds at u, i.e., the gradients 

V/11 (u), . . . , V/i OTl («), V# mi (it), . . . , Vg Jr (u) 

are linearly independent, then there exist Lagrange multipliers Ai,...,A mi and 
fii,...,fim a satisfying 

mi 7D.2 

(1.3) V/(u) = J^XiVhiiu) + Y^VjVgjiu), 

(1.4) MlSlO") = ••• = fJ,m 2 9m 2 ( u ) = °> Ml > 0, ...,/x m2 > 0. 

The equation (|1.3|) is called the first order optimality condition (FOOC), and (|1 ,4|) 
is called the complementarity condition. If it further holds that 

(1.5) [i\ + gi(u) > 0, . . . , n m2 + g m2 (u) > 0, 
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we say the strict complementarity condition (SCC) holds at u. Note that strict 
complementarity is equivalent to fij > for every j 6 J(u). Let L{x) be the 
associated Lagrange function 

1 a i 

L(x) := f(x) - \h,{x) - ^ ^j9j(x)- 
■i=i je.J{u) 

Clearly, (|1.3|) implies S7 x L(u) = 0. The polynomials f,hi,gj are infinitely many 
times differentiable everywhere. Thus, under the constraint qualification condition, 
the second order necessity condition (SONC) holds at u, i.e., 

(1.6) v T V 2 x L(u)v > for all v € G{u)^. 

Here, G{x) denotes the Jacobian of the active constraining polynomials 
G{x)=[Vh 1 {x) ••• Vh mi (x) Vg n {x) ■■• Vg Jr {x)] T 
and G(m) 1 " denotes the null space of G(u). If it holds that 

(1.7) v T V 2 x L(u)v > for all ^ v G G(w)~\ 

we say the second order sufficiency condition (SOSC) holds at u. 

We summarize the above as follows. If the constraint qualification condition 
holds at u, then (|1.3p . (|1.4[) and (|1.6j) are necessary conditions for u to be a local 
minimizer of / on K, but they are not sufficient. If (|1.3[) . (jl.4[) . (|1.5|) and (II. 7[) 
hold at a point it G X, then u is a strict local minimizer of (|1.1[) . The first 
order optimality, strict complementarity and second order sufficiency conditions 
are sufficient for strict local optimality. We refer to [U Section 3.3]. 

This paper studies the relationship between optimality conditions and finite con- 
vergence of Lasserre's hierarchy. Denote M.[x]d := {p G M.[x] : deg(p) < d} and 
[m] := {1, . . . , to}. Our main conclusions are the following two theorems. 

Theorem 1.1. Suppose the archimedean condition holds for the polynomial tuples 
h and g in If the constraint qualification, strict complementarity and second 

order sufficiency conditions hold at every global minimizer of 1 1.1]) . then Lasserre's 
hierarchy of il.ty) has finite convergence. 

Theorem 1.2. Let do, d\, . . . , d mi , d'i, . . . , d' m2 be positive integers. Then there 
exist a finite set of polynomials ipi,... ,ipL (cf. Condition \4-3\ ), which are in the co- 
efficients of polynomials f G H^a;]^, hi G K^a;]^ (i G [toi]/, gj G K[a^]d'. (j G [to^],), 
such that if <pi, . . . , (fL do not vanish at the input polynomial, then the constraint 
qualification, strict complementarity and second order sufficiency conditions hold 
at every local minimizer of il.ll) . 

The proof of Theorem 11.11 uses a result of Marshall on boundary hessian condi- 
tions [151117] . and the proof of Theorem ll.2l uses elimination theory in computational 
algebra. Theorem 11.21 implies that these classical optimality conditions hold in a 
Zariski open set in the space of input polynomials with given degrees. The paper is 
organized as follows. Section 2 presents some backgrounds in the field; Section 3 is 
mostly to prove Theorem ll.il Section 4 is mostly to prove Theorem II .2] Section 5 
makes some discussions. 
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2. Preliminary 

2.1. Notation. The symbol K (resp., C) denotes the set of real (resp., complex) 
numbers. A polynomial is called a form if it is homogeneous. For / G M[x], f de- 
notes the homogenization of/, i.e., f(x) = Zg 0g ^' ■ j '(x j 'xq) with x := (xq, X\, . . . , x n ) 
The symbol |j • H2 denotes the standard 2-norm. For a symmetric matrix X, X > 
(resp., X >- 0) means X is positive semidefinite (resp. positive dehnite). The 
determinant of a square matrix A is det A The N x N identity matrix is de- 
noted as In- K p is a polynomial in x, Vp (resp., V 2 p) denotes the gradient 
(resp., Hessian) of p with respect to x; if p has variables in addition to x, V x p 
(resp., V^p) denotes the gradient (resp., Hessian) of p with respect to x. For 
Pi, . . . ,p r G R[x], Jac(pi, . . . ,p r )\u denotes the Jacobian of (pi, . . . ,p r ) at u, i.e., 

Jac(j>i, . . -,Pr)\u = {dpi{u)/dXj)i<i<r,l<j<n- 

2.2. Some basics in real algebra. Here we give a short review on elementary 
real algebra. More details can be found in [3] . 

An ideal I of R[x] is a subset such that I ■ M[x] C I and I + I C I. Given 
Pi, . . . ,p m G M[x], (px, ■ ■ ■ ,Pm) denotes the smallest ideal containing all pj, which 
is the set p\ ■ K[s] + • • • + p m • R[x]. A variety is a subset of C™ that consists of 
common zeros of a set of polynomials. A real variety is the intersection of a variety 
and the real space R". Given a polynomial tuple p — (pi, . . . ,p r ), denote 

V(p) := {v G C" : Pl (v) =■■■= p r (v) - 0}, 

Vk(p) := {v G M" : pi(») = • • • = Pr(«) = 0}. 

Every set T C K n is contained in a real variety. The smallest one containing T is 
called the Zariski closure of T, and is denoted by Zar(T). In the Zariski topology 
on K™, the real varieties are closed sets, and the complements of real varieties are 
open sets. Denote I(T) := {q e R[x] : q(u) — OVu e T}, which is an ideal in R[x] 
and is called the vanishing ideal of T. 

Let ft = (hi, . . . , /i mi ) and 5 = (g\, . . . , g m2 ) be the polynomial tuples as in (jl.ll) . 
and K be the feasible set of (jl.ll) . Recall the definitions of (h)2k and Qk{g) in the 
Introduction. Clearly, the union UkeN(h)2k is the ideal (h) :— (hi, . . . , h mi ). The 
union Q(g) := UkefiQkig) is called the quadratic module generated by g. The set 
(h) + Q(g) is called archimedean if R — \\x\\\ G (h) + Q(g) for some R > 0. Clearly, 
if p G (/i) +<2(<7), then p is nonnegative on K, while the converse is not always true. 
However, if p is positive on K and (h) + Q(g) is archimedean, then p G (h) + Q(g). 
This is called Putinar's Positivstellensatz. 

Theorem 2.1 (Putinar, [23]). Let K be the feasible set of II 1.1)) . Suppose (h)+Q(g) 
is archimedean. If p G M[x] is positive on K , then p G (h) + Q(g). 

2.3. The boundary hessian condition. Let K be the feasible set of (|1.1|) and 

h = (hi, . . . , h rni ). Let u be a local minimizer of (II. ip . and £ be the local dimension 
of Vr(/i) at m (cf. [2, §2.8]). We first state a condition about parameterizing K 
around u locally, which was proposed by Marshall. 

Condition 2.2 (Marshall. |1 7]). i) The point u on Vr(/i) is nonsingular and there 
exists a neighborhood O of u such that Vr(/i) nO is parameterized by uniformizing 
parameters ti,...,tf, ii) there exist 1 < vi < ■ ■ ■ < v r < m,%, such that tj = g v . 
(j = 1, . . . ,r) on Vm.(h) n O and KC\0 is defined by ti > 0, . . . , t r > 0. 
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The following condition was introduced by Marshall [HI [17] in studying Puti- 
nar type representation for nonnegative polynomials, and it is called the boundary 
hessian condition (BHC). 

Condition 2.3 (Marshall. [15[ H7] ) . Assume Condition \2.2\ holds. Expand f locally 

around u as f = fo + fx + f% H , with every fi being homogeneous of degree i in 

ti, . . . , tg. The linear form fx — a\t\ + • • • + a r t r for some positive constants a\ > 
0, . . . , a r > 0, and the quadratic form /a(0, . . . , 0, t r +ii • • • > tl) * s positive definite in 
{t r+ \, . . . ,tg). 

If K is compact and the boundary hessian condition holds at every global min- 
imizer, then (11.1[) has finitely many global minimizers. (See the proof of Theo- 
rem 9.5.3 in [TB].) Marshall proved the following important result. 

Theorem 2.4. (Marshall, pH Theorem 9.5.3],) Let V = V R (h) and f min be the 
minimum of il.l]) . If (h) + Q{g) is archimedean and the boundary hessian condition 
holds at every global minimizer of then f — f m i n G I{V) + Q{q)- 

In the above, if I(V) = (h) (i.e., (h) is real, \2\ §4.1]), then / - f min e (h)+Q(g). 
Theorem 12.41 can also be found in Scheiderer's survey [2H1 Theorem 3.1.7]. 

2.4. Resultants and discriminants. Here, we review some basics of resultants 
and discriminants. We refer to [2J [TH [5S] for more details. 

Let /i, . . . , /„ be forms in x = (x±, . . . , x n ). The resultant Res{f\, . . . , /„) is a 
polynomial, in the coefficients of /i, . . . , /„, having the property that 

Res(f f n ) =0 ^ 30^eC fl ,/ 1 (u) = - = /» = 0. 

The discriminant of a form / is defined as 

A(/) := i?es — ,. 



k 9a;i ' dx r , 
So, it holds that 

A(/) = <s=^> 30^ue C", V/(w) = 0. 

Both Res(f±, . . . , /„) and A(/) are homogeneous, irreducible and have integer co- 
efficients. 

Discriminants and resultants are also defined for nonhomogeneous polynomials. 
If one of fo, fi, . . . , f n is not a form in x, then Res(fo, fi, . . . , /„) is defined to be 
i?es(/o, . . . , f n ), where each fi is the homogenization of fi. Similarly, if / is not a 
form, then A(/) is defined to be A(/). 

Discriminants are also defined for several polynomials [TO] . Let fi, ■ ■ ■ , f m be 
forms in x of degrees d\, . ■ ■ , d m respectively, and m < n — 1. Suppose at least one 
di > 1. The discriminant of f\,...,f m , denoted by A(/i, . . . , f m ), is a polynomial 
in the coefficients of fx,..., f m , having the property that A(/i, . . . , f m ) = if and 
only if there exists ^ m e C™ satisfying 

(2.1) fx{u) = --- = f m {u) = 0, rank[V/i(u) ••• Vf m (u)] < m. 
If one of fx, ■ ■ ■ , fm is nonhomogeneous and m < n, then A(/i, . . . , f m ) is defined 
to be A(/i, . . . , f m ). In the nonhomogeneous case, A(/i, . . . , f m ) = if there exists 
u G C" satisfying (J2T]) (cf. [T§]). 

We conclude this section with an elimination theorem for general homogeneous 
polynomial systems. 
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Theorem 2.5. (Elimination Theory, [7, Theorem 5.7A]) Let /i, . . . , f r be homoge- 
neous polynomials in xq , . . . , x n , having indeterminate coefficients aij . Then there 
is a set g\, . . . ,gt of polynomials in the aij , with integer coefficients, which are ho- 
mogeneous in the coefficients of each f t separately, with the following property: for 
any field k, and for any set of special values of the € k, a necessary and suf- 
ficient condition for the fi to have a common zero different from (0, . . . , 0) is that 
the aij are a common zero of the polynomials gj . 



3. Optimality conditions and Finite Convergence 
This section is to prove Theorem ll.il It is based on the following theorem. 

Theorem 3.1. Let u be a local minimizer of 11. If the constraint Qualification, 
strict complementarity and second order sufficiency conditions hold at u, then f 
satisfies the boundary hessian condition at u. 

Proof. Let J(u) := {ji, . . . ,j r } be the index set of inequality constraints that are 
active at u. For convenience, we can generally assume u = 0, up to a shifting. Since 
the constraint qualification condition holds at 0, the gradients 

V/ii (0), . . . , Vh mi (0), Vg h (0), . . . , Vg jr (0) 

are linearly independent. The origin is a nonsingular point of the real variety 
VR(h), because the gradients V/ii(0), . . . , V/i mi (0) are linearly independent. Up to 
a linear coordinate transformation, we can further assume that 



(3.1) 



[V 5jl (0) 
[Vfci(0) 



v. 9> ( 

V/i„,„ ( 



Let £ := n — mi, which is the local dimension of Vr(/ii, . . . , h mi ) at (cf. [U 
Prop. 3.3.10]). Define a function <p(x) :— (ipi(x), ipu(x), ipiu(x)) : R n — > M™ as 



(3.2) 



9h 0)' 



Sir ( X ). 



Cr+1 



Xf 



If I II (x) 



hi(x) 



h mi (x)) 



Clearly, </?(0) = 0, and the Jacobian of if at is the identity matrix /„. Thus, by the 
implicit function theorem, in a neighborhood O of 0, the equation t = if(x) defines 
a smooth function x — ip^ 1 (t). So, t = (ti, . . . , t n ) can serve as a coordinate system 
for W 1 around and t = <p(x). In the i-coordinate system and in the neighborhood 
O, Vr(/ii, . . . , h mi ) is defined by linear equations = • • • = t n = 0, and K n O 
can be equivalently described as 



h > 0,...,t r > 0, tt 



+i 



tn 



0. 



Let \i(i € [mi]) and £ [m 2 ]) be the Lagrange multipliers satisfying (ll.3l) - (|1.4l) . 
Define the Lagrange function 



L(x) := f(x) 



II 1 1 



Xihi{x) -^2^j k gj k ( x )- 
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Note that \7 X L(0) = 0. In the ^-coordinate system, define functions 

n r 

F(t) := /for 1 (*)), L(t) := L^ 1 (t)) = F(t) - £ A 4 _A - £ Hk t k . 

i=l+l k=l 

Clearly, V x L(0) = implies V t i(0) = 0. So, it holds that 

dF(0) 



dtk 

8F(0) 
dt k 
dF(Q) _ 
dt k ~ 



H jh (k = l,...,r), 

= Q(k = r + l,. ..,£), 
Xk-i {k = £ + 1,..., n). 



Expand F(t) locally around as 

F(t) = fo + f 1 (t) + f 3 (t) + f 3 (t) + "- 
where each fi is a form in t of degree i . Clearly, we have 

/i (t) — Mji ti H hMir*r on U+i = ■■■ =t n = 0. 

For . . . ,tg near zero, it holds that 

F(0, • . • , 0, t r+ i, . . . , t/, 0, . . . , 0) = L(0, . . . , 0, U+i, . . . , t t , 0, . . . , 0) 
L^-^O,... ,0,^ + i,. ..,^,0,...,0)). 
Denote x(t) := V? _1 (*) = (<^T (i),...,^ 1 ^))- For all we have 
d 2 L(t) _ 

Kk.s< 



dUdtj 



Kfe<r 



dUdtj 



Evaluating the above at x = t = 0, we get (note V x L(0) = 0) 



6> 2 £(0) 

dtidtj 



E 



a 2 L(o) a^o) a^-Ho) 



Kk,a<t 



dx k dx s dti 



dt, 



Note that Jac((p) |o = Jac(ip )|q = /«■ So, for all r + 1 < z, j < £, we have 
(3.3) 



a 2 / 2 




d 2 F 




d 2 L 




d 2 L 




dtidtj 


t=0 


dtidtj 


t=o 


dtidtj 


t=o 


dxidxj 


x=0 



The strict complementarity condition (|1.5j) implies that fij ± > 0, . . . , \ij T > 0. 
So, the coefficients of the linear form fi^ti + • • • + fij r t r are all positive. The second 
order sufficiency condition (| 1 . T[) implies that the sub-Hessian 

/5 2 L(0) 



\dxidxj j r+ i<jj<g 

is positive definite. By (|3.3j) . the quadratic form ji is positive definite in {t r +i, ■ ■ . , ti). 
Therefore, / satisfies the boundary hessian condition at 0. □ 



Now, we give the proof of Theorem 11.11 
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Proof of Theorem ll.il By Theorem 13.11 we know the boundary hessian condition 
is satisfied at every global minimizer of / on K, when the constraint qualification, 
strict complementarity and second order sufficiency conditions hold. Then, by 
Theorem 12.41 of Marshall, we know there exists o\ £ Q(g) such that 

/ - fmin = ci mod I(Vn(h)). 



Let / := / 
lensatz (cf. 



— fmin — Then / vanishes identically on V^(h). By Real Nullstel- 
[21 Corollary 4.1.8]), there exist £ €N and er 2 G £M[x] 2 such that 

f 2e + a 2 e(h). 



Let c > be big enough such that s(t) := 1+t-f 
in t (cf. [HJ Lemma 2.1]). For each e > 0, let 



ct is an SOS univariate polynomial 



a 2 + cr 1 . 



Then, one can verify that 

& ■= f - (/„ 



,l-2£/ ?2l 



(f M + a 2 )e(h). 



Clearly, there exists k G N such that a t G Qk (g) and 4> e G (ti)2k for all e > 0. 
So, for every e > 0, 7 = f m i n — e is feasible in (II. 2[) for the order /c . Hence, 
jfco > /mm - e- Since e > can be arbitrary, we get f ko > f rnin . Recall that 
fk < fmin for all k and {/&} is monotonically increasing. Hence, we get = / m ^ n 
for all k > k , i.e., Lasserre's hierarchy has finite convergence. □ 



Theorem 13 . 1 1 shows that the constraint qualification, strict complementarity and 
second order sufficiency conditions imply the boundary hessian condition. Typi- 
cally, to check the boundary hessian condition by its definition, one needs to con- 
struct a local paramctrization for the feasible set K and verify some sign condi- 
tions, which would be very inconvenient in applications. However, checking opti- 
mality conditions is generally much more convenient, because it does not need a 
parametrization and only requires some elementary linear algebra operations. This 
is an advantage of optimality conditions over the boundary hessian condition. We 
show this in the following example. 

Example 3.2. Consider the optimization problem: 

min x^ ~\~ x 2 ~\~ x^ ~f~ 3x \X 2 x^ x^^x^ ~\~ -£3) *^2(*^'3 *^i) ^3(^1 ^2) 



s.t. 



1. 



The objective is the Robinson form which is nonnegative but not SOS (cf. 24 ). 
The minimum f min = 0, and the global minimizers are 

i=(±l,±l,±l) ,-L(±l,±l,0) i(±l,0,±l),i=(0,±l,±l). 

The unit sphere is smooth, so the constraint qualification condition holds at every 
feasible point. There is no inequality constraint, so strict complementarity is au- 
tomatically satisfied. It can be verified that the second order sufficiency condition 
(jl.7p holds on all the global minimizers. For instance, at u = "^(L lj 1)) 



vim = 9 



/ 


"1 cf 




Y 




T 




T 


3 • 


1 




1 




1 


j, G(u)^ = 


1 


V 


1 




1 




1 




1 
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Clearly, (|1.7j) is satisfied at u. By Theorem ll.il Lasserre's hierarchy for this problem 
has finite convergence. A numerical experiment by GloptiPoly 3 [8] verified that 
fs = fmin = 0, modulo computer round-off errors. □ 

In Theorem ll.l[ none of the optimality conditions there can be dropped. We 
show counterexamples as follows. 

Example 3.3. (a) Consider the optimization problem: 
J min 3xi + 2x2 

\ s.t. x\ -x\ - [x\ + x 2 2 ) 2 > 0, xi > 0. 

It can be shown that the origin is the unique global minimizer. The constraint 
qualification condition fails at 0, and the first order optimality condition (|1.3[) 
fails. The feasible set has nonempty interior, so the SOS program (11.21) achieves its 
optimal value (cf. |11)V Lasserre's hierarchy for this problem does not have finite 
convergence, which is implied by Proposition 13.41 in the below. 

(b) Consider the optimization problem: 

J min X\X2 + x\ + x\ 

y S.t. X\ > 0, X2 > 0, 1 — X\ — X2 > 0. 

Clearly, is the unique global minimizer. The constraint qualification condition 
holds at 0. The Lagrange multipliers are all zeros. The second order sufficiency 
condition (|1.7p also holds at because the null space G(0) = {0}. However, the 
strict complementarity condition fails at 0. Lasserre's hierarchy for this problem, 
does not have finite convergence, as shown by Scheiderer [26j Remark 3.9]. 

(c) Consider the optimization problem: 

J min x\ x\ + x\x\ + x% — Zx\x\x\ + e(x\ + x\ + x%) 3 
\ s.t. 1 -x\ -x% -x\ > 0. 

For every e > 0, is the unique global minimizer, and the constraint qualification 
and strict complementarity conditions hold at 0. However, the second order suffi- 
ciency condition fails at 0. For e > sufficiently small, Lasserre's hierarchy for this 
optimization problem does not have finite convergence, as shown by Marshall [151 
Example 2.4]. □ 

The first order optimality condition (|1.3p is necessary for Lasserre's hierarchy to 
have finite convergence. This is summarized as follows. 

Proposition 3.4. Suppose achieves its optimal value. If the first order opti- 

mality condition il.3\) fails at a global minimizer of il.l]) . then Lasserre 's hierarchy 
cannot have finite convergence. 

Proof. Suppose otherwise fk — fmin for some k. Since (| 1 . 2[) achieves its optimum, 

mi 7722 

/ - fmin = ^l<Pihi + ^O-jgj 

i=l j=0 

for some <fii £ M.[x] and <Xj S SIR [a;] 2 . Let u be a global minimizer of (jl.ip . Note 
that every hi(u) = and gj(u)o~j{u) = 0. Differentiate the above with respect to x 
and evaluate it at u, then we get 

mi 7712 

v /(") = £&(«)Vfe(u) + £(ffjMVjj(u) + ffjHV^(u)). 
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Since every uj is SOS, gj(u)o~j(u) = implies gj(u)'Vcrj(u) = 0. Hence, 

mi 7712 

i=l j=0 

But this means that (II .3|) holds at u, which is a contradiction. So Lasserre's hier- 
archy cannot have finite convergence. □ 

In Proposition 13.41 the assumption that (|1.2I) achieves its optimal value cannot 
be dropped. (This assumption is satisfied if K has nonempty interior, cf. As 
a counterexample, consider the simple problem 

min x s.t. — x 2 > 0. 

The global minimizer is 0. The first order optimality condition fails at 0, but 
Lasserre's hierarchy has finite convergence (fk = fmin = for all k > 1). 

4. Zariski Openness of Optimality Conditions 

This section is mostly to prove Theroem 11.21 For this purpose, we need some 
results on generic properties of critical points. 

4.1. Generic properties of critical points. Given polynomials po € M[x]d , . . . ,pk € 

K[x]d fc with k < n, consider the optimization problem 

(4.1) min Po(x) s.t. Pi(x) = ■ ■ ■ = Pk{x) = 0. 

Its Karush-Kuhn- Tucker (KKT) system is defined by the equations 

fe 

(4.2) V x p (x) -^2\iV x pi(x) =0, px(x) = ■■■ =p k (x) =0. 

8=1 

Every (x, A) satisfying (|4.2[) is called a critical pair, and such x is called a critical 
point. Let 

rank [V x p (x) V x pi(x) ■■■ V x p k (x)]<k 
pi(x) = • • • = p k (x) = 

be the KKT variety of (|4.1j) . Clearly, every critical point belongs to JC(p). 

First, we discuss when does K(jp) intersect the variety q(x) = of a polynomial 
q £ M.[x]d k+1 , i-e., when does the polynomial system 

rank [V x p (x) V x pi{x) ■■■ V x p k {x)] < k 
p\(x) = ■■■ = pk(x) = 0, q(x) = 

have a solution in C n ? For a generic p, K,(p) is a finite set (cf. [TH Prop. 2.1]), and 
it does not intersect q(x) = if q is also generic. Consider the homogenization in 
x := (xi, . . . , x n ) of the polynomial system (|4.4p : 

rank[V x ^)(x) V x pi(x) ■■■ V x p k {x)]<k, 
pi(x) = • • • = pk(x) = q(x) = 0. 

Its variable is x := (xq, . . . , x n ). When k < n, the matrix in (|4.5p has rank < k if 
and only if all its maximal minors vanish; when k = n, the rank condition in (|4.5|) is 
always satisfied and can be dropped. Thus, in either case, (I4.5[) can be equivalently 
defined by some homogeneous polynomial equations, say, 

M x {x) = ■■■ = M e (x) =0. 



(4.3) K{p) :=\xe 



(4.4) 



(4.5) 
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Note that the coefficients of every Mi are also homogeneous in the ones of each of 
po, . . . ,Pk,q. By Theorem I2.5[ there exist polynomials 

Ri(p ,.. . ,Pk;q), ■ ■ -,Rt(po, ■ ■ ■ ,Pk;q) 
in the coefficients of po, . . . ,p k , q such that 

• every Ri(po, ■ ■ ■ ,Pk',q) has integer coefficients and is homogeneous in the 
coefficients of each of po,Pi, . . ■ ,Pk, q~, 

• the system (|4.5p has a solution 7^ x <E C™ +1 if and only if 

Rx(p ,...,p k ;q) =■■■ = R t (po, ■ ■ ■ ,Pk; q) = 0. 
Define the polynomial 3?(po, ■ ■ ■ ,Pk',q) as 

(4.6) &(po, ■ ■ ■ ,Pk;q) ■■= Ri(po, ■ ■ ■ ,Pk;q) 2 H 1- Rt(po, ■ ■ -,Pk] q) 2 - 

Note that M(j>q, . . . ,Pk', q) is a polynomial in the coefficients of the tuple 

(p , ...,p k ,q) & R[a;]d x • • • x R[x] dk x R[x] dk+1 . 
Combining the above, we can get the following proposition. 

Proposition 4.1. Let p £ R[x]d , ■ ■ ■ ,Pk £ ^[ x ]d k ,q £ ^[x]d k+1 , and 3$ be 
as defined in j4.6\ ). Then J^.5[ ) has a solution / 1 £ C™ +1 if and only if 
8£(pQ, . . ■ ,Pk', q) = 0. In particular, if 3#(po, . . . ,Pk',q) 7^ 0, then j4-4\ ) has no 
solution in C™. 

We would like to remark that the polynomial 3$ in (14.61) does not vanish iden- 
tically in (po, • ■ • ,Pk, q) € K[^]d n x ■ • • x M[x]d fc x R[x]d k+1 , for any given positive 
degrees do, . . . , dk, d k +i- A proof for this fact is given in the Appendix. 

Second, we discuss when the KKT system (|4.2j) is nonsingular. Denote 



L p (x, A) := p Q (x) - ^ ^iPi( x )- 



The polynomial system (|4.2[) is nonsingular if and only if the square matrix 

, ,s _ [ V 2 x L p (x,\) Jac( Pl ,...,p k )\T 
U r [X > A) - [jac( Pl ,..., Pk )\ x 

is nonsingular at every critical pair (2, A). If every pi is generic, there are only 
finitely many critical pairs, and (|4.2p is nonsingular if deti? p (x, A) does not vanish 
on them. 

The matrix H p {x, A) is singular if and only if there exists (0, 0) ^ (y, v) £ 1" x R fe 
such that 

k 

(4.7) V 2 x L p (x,\)y + Jac(p 1 ,...,p k )\lv = 0, y € f| V Pl {x)^. 



When Jac(pi, . . . ,pk)\x has full rank k, the existence of a pair (y, v) ^ (0, 0) satis- 
fying (|4.7j) is equivalent to the existence of a pair (y, v) with y 7^ satisfying (|4.7[) . 
When (|4.2I) is nonsingular, there is no y 7^ satisfying (|4.7p for any critical pair 
(x, A). Write v = (vi, . . . ,v k ), then (14.2[) and (14. 7|) together are equivalent to 



(4.8) 



(ylpo)y 



- EA 4 



VxPi 

iy 2 xPi)y 



k 
z=l 



= 0, 



Pk{x) = (VxPiYy 





v x pi 

\V x Pk) T y = 0. 
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Define the (2ra) x (2k + 1) matrix 

p / n _ T VxPo ■ ■ ■ ^xPk 
[X,V) [(V 2 xPo )y ■■■ (V 2 xPk )y \7 xPl 

Clearly, every pair (x, y) in (|4.8|) satisfies 



(4.9) 

If the vectors 



rankP(x,y) < 2k, pi(x) = ■ ■ ■ = p k (x) = 0, 
{^xPi) T y = ■ ■ ■ = (V xPk ) T y = 0. 



(i=l,...,k) 



^xPi 







(vlPi)y_ 


j 





arc linearly independent, (|4.8|) and (|4.9|) are equivalent. Consider the homogeniza- 
tion in x of ([43)1 : 

rank P(x,y) <2k, p\ (x ) = ■ ■ ■ = p k (x) = 0, 
(V,pi) T y = • • • = (V xPk ) T y = 0. 



(4.10) 

In the above, x := (xq 
P(x,y) := 



. . , a;„) and 



(V^Po)y 



^xPk 

(v 2 xPk)y VxPi 



^xPk 



When k = n, we always have rank P(x,y) < 2k and the rank condition in (|4. 10[) 
can be dropped. When k < n, we can replace rankP(i, y) < 2k by the vanishing of 
all maximal minors of P(x,y). In either case, (|4.10[) could be equivalently defined 
by some polynomial equations, say, 

N x {x,y) =■■■ = N r (x,y) = 0. 

Note that all Ni, . . . ,N r are homogeneous in both x and y, and their coefficients are 
also homogeneous in the ones of each of Po,pi, . . ■ ,Pk- By applying Theorem 12.51 
twice (first in x and then in y), there exist polynomials Di(po,pi, . . . ,pk) (i — 
1, . . . , s), in the coefficients of po,p\, . . . ,pk, such that 

• every Di(po,pi, ■ ■ ■ ,Pk) has integer coefficients and is homogeneous in the 
coefficients of each of po,pi, ■ ■ ■ ,p k ; 

• there exist O^ie C rl+1 and ^ y € C" satisfying (|4~TUf if and only if 

£>ibo,Pi, ■■ - ,Pk) = ■ ■ ■ = D s (p a ,p 1 , . . . ,p k ) = 0. 
Define the polynomial ^(pojPi, ■ ■ ■ ,Pk) as 

(4.11) @(p , Pl , • ■ • ,Pk) ■= D 1 (p Q ,p 1 , . . . ,p k ) 2 H h D s (p Q ,p 1 , . . . ,p k ) 2 . 

Note that @(po, ■ ■ ■ ,Pk) is a polynomial in the coefficients of the tuple 

(po,.. . ,p k ) e R[x] do x • • • x R[x]d„. 
Combining the above, we can get the following proposition. 

Proposition 4.2. Let po E R[x]d , ■ ■ ■ ,Pk € and & be as defined in l[4-ll\ )- 

Then \4- Mfy has a solution (x,y) E C™ +1 x C" with x, ^ 0, y ^ if and only if 
@(po, ■ . . ,Pk) = 0. In particular, if %l(po, • ■ • ,Pk) 7^ 0, then is a nonsingular 

system. 



The following special cases are useful to illustrate Proposition [O 
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• (Every deg(pi) = 1.) Let pi — afx + bi for i = 0, . . . , k. If k < n and 
do, oi, . . . , afe are linearly independent, then P is a constant matrix of rank 
2fc + 1. If k = n and a\, . . . , a n are linearly independent, then there is no 
y 7^ satisfying V x p^ V = for i = 1, . . . , n. So, if every is generic, then 
(|4.10j) has no complex solution (x, y) with x ^ 0, y 7^ 0. 

• (fc = 0, i.e., (|4.1[) has no constraints.) The system (|4.10l) is then reduced to 

(4.12) V xP ~ (£)=0, (Vlp~o(x)) y = 0. 

If deg(po) = 1 and po is nonzero, V;rPo(:r) = has no complex solution. 
If deg(po) = 2 and po — x 1 Ax + 2b T x + c with det(A) ^ 0, there is no 
y 7^ satisfying (\7 2 p (x)) y = 0. When deg(po) > 3, by the definition 
of discriminants for several polynomials (cf. §2.4), (|4.12|) has a complex 
solution (x, y) with x ^ 0, y ^ if and only if 



So, if po is generic, there are no x ^ 0, y 7^ satisfying (|4.12[) . 
The above observations can be simply implied by Proposition! 



In Proposition 14.21 one might naturally think of replacing @ by 
(4.13) A(V x p - Jac(p!,. . . ,p k )\l\,pi, ■ ■ . >Pk), 



which is the discriminant for the set of polynomials defining (|4.2j) . by considering 
Ai, . . . , Afe as new variables, in addition to x. However, this approach is problematic. 
The main issue is that the discriminantal polynomial in (|4. 13[) might be identically 
zero, e.g., when deg(po) < niaxi<i<fc deg(pi). For convenience, consider the simple 
case n > k — 1 and a := deg^) — deg(po) > 0. By definition of discriminants for 
several polynomials (cf. §2.4), the discriminant in (14. 13)) vanishes if there exists a 
complex vector (xq, x\, . . . , x n , Ai) 7^ satisfying 



(4.14) 



1 ■ V x p - AiV^i = 0, pi(x , . . . ,x n ) = 0, 
~x% +1 ■ W 2 x po - XiWln V x p{ 
V x pi T 



det 



= 0. 



Let (iti, . . . , u n ) 7^ be a complex zero of pi(Q, x\, . . . , x n ). Then, (0,tti, . . . , u n , 0) 
is a nonzero solution of (|444|) . So, for any po,Pi, <I4. 14|) always has a nonzero 
complex solution like (0, u±, . . . , u„, 0). This means that the discriminant in (|4.13[) 
identically vanishes. On the other hand, the polynomial & in (|4. 1 1[) does not 
vanish identically in (po, ■ ■ ■ ,Pk) G ^Hdo x ''' x ^Md,,, for any given positive 
degrees do, . . . ,dk- A proof for this fact is given in the Appendix. 

Typically, the polynomials M in (|4.6|) and @ in (|4. 1 1[) are very difficult to com- 
pute explicitly. They are mostly for theoretical interests. 



4.2. Zariski openness of optimality conditions. This section is to prove that 
the constraint qualification, strict complementarity and second order sufficiency 
conditions all hold at every local minimizer of (jl.ip if a finite set of polynomials, 
which are in the coefficients of polynomials /, hi (i € [mi]),gj (j g [ma]), do not 
not vanish at the input polynomials. (That is, they hold in a Zariski open set in 
the space of input polynomials.) These polynomials are listed as follows. 
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Condition 4.3. The polynomials fo G K.[a;]d 0} hi G K^ld; (i G [ m i]A o,nd 9j G 
K-Hd'. (j G [ TO 2]j tfii/i mi < n satisfy (Res, A are from $2.4, & from f^.6| ) and 
from'fFTIV): 

(a) If mi + m,2 > n + 1, for all 1 < ji < ■ ■ ■ < j n - mi +i < m 2 , 

Res(hi, . . . ,h mi , g jl7 . . . , g jn _ mi+1 ) ^ 0. 

(b) For all 1 < ji < ■ ■ ■ < j r < m2 with < r < n — mi, 

A(hi,...,h mi ,g h ,...,g jr ) ^ 0. 

(c) For all 1 < ji < ■ ■ ■ < j r < to 2 wif/i < r < n — mi, 

^{f,Pi, ■ ■ ■ ,PklPk+i) ^ 0, 

where (pi,. . . ,p k ,Pk+i) is a re-ordering of (hi, h mi ,g n ,. . .,g 3r ). 

(d) For a/Z 1 < ji < • • • < j r < to 2 wii/i < r < n — mi, 

<3(f ,h x , . . . ,h mi ,g h , . . . ,g 3i ) ^ 0. 



First, we study the relationship between Condition 14.31 and properties of critical 
points. Let u G K be a critical point of (11.11) (i.e., (|1.3[> and (jl.4p are satisfied for 
some Ai, /ij, excluding the sign conditions [ij > 0). Let J(u) := {ji, . . . ,j r } be the 
index set of active inequality constraints. Denote 

mi 

L(x) := /(a;) - AA-(#) - ^ H9j(x), 
i=i jeJ(u) 

G(x):= [Vhi (x) ■■■ Vh mi (x) V 9jl (x) ■■■ Vg jr (x)] T , 

\VlL(x) G(x)^ 
G(x) 



H(x) :-- 



Proposition 4.4. Let u G K and Xi, fij satisfy il.3\) - [L^\ l (excluding the sign con- 
ditions fij >0), and L(x),G(x), H(x) be as above. Condition ^ -3\ has the following 
properties: 

i) Item (a) implies that at most n — mi of gj 's are active at every point of K. 

ii) Item (b) implies that the constraint qualification condition holds at every 
point of K . 

hi) Item (c) implies that \ ^ 0, fij ^ for all i G [mi] and j G J(u). 
iv) Item (d) implies that H(u) is nonsingular, i.e., detH(u) 7^ 0. 

Proof, i) If more than n — mi of gj 's vanish at a point u G K, say, g 3l , . . . , gj n _ m +1 , 
then there are n+1 polynomials vanishing at u, including hi, ... , h mi . This implies 
the resultant 

Res(hi, . . .,h mi ,g 3l ,. . .,gj n _ mx+1 ) = 0, 

which violates item (a) of Condition 14.31 So, the item i) is true. 

ii) By item (b) of Condition 14.31 A(hi, . . . , h mi ,g }1 ,. . . , gj r ) ^ 0. By the def- 
inition of A (cf. §2.4), the gradients of hi, ■ ■ ■ , h mi ,gj 1 , . . . ,gj r at u are linearly 
independent, i.e., the constraint qualification condition holds at u. 

iii) Suppose otherwise one of A;(z G [mi]) or pLj(j G J(u)) is zero, say, fij r = 0, 
then u is also a critical point of the optimization problem 

min f(x) s.t. hi(x) = (i G [mi]), gj(x) = (j G J(u)/{j r }). 
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Note that gj r (u) — 0. By definition of 2$ in (|4.6[) and Proposition 14. 11 we get 

M{f,hi,...,h mi ,g jl ,...,g jr _^gj r ) = 0, 
which contradicts item (c) of Condition 14.31 So, the item hi) must be true. 



iv) This is implied by definition of & in (|4.11j) and Proposition 14.2 



□ 



Second, we study the relationship between Condition 14.31 and the optimality 
conditions. This is summarized as follows. 



Proposition 4.5. // Condition \4-3\ holds, then the constraint qualification, strict 
complementarity and second order sufficiency conditions all hold at every local min- 
imizer of This is implied by the following properties: 

1) Item (a) of Condition \4.3\ implies that at most n — mi of gj 's are active at 
every local minimizer of 

2) Item (b) of Condition \4-3\ implies that the constraint qualification condition 
holds at every local minimizer of il.l)) . 

3) Items (b ) and (c) of Condition \4-3\ imply that the strict complementarity 
condition holds at every local minimizer of U.l\) . 

4) Items (b) and (d) of Condition \4-S\ imply that the second order sufficiency 
condition holds at every local minimizer of 

Proof. Let it be a local minimizer of 

1) and 2) are implied by i), ii) of Proposition 14.41 respectively. 

3) By item 2), the constraint qualification condition holds at u. So, there exist 

satisfying (|1.3[) - (|1.4p with all fij > 0. If j $ J{u), then gj(u) > and 
[ij + gj{u) > 0; if j € then fij ^ by item hi) of Proposition 14. 4\ and hence 

/ij > and fij + gj(u) > 0. This means that the strict complementarity condition 
holds at u. 

4) By item 2), the constraint qualification condition holds at u. So, (|1.3j) and 
(|1.4j) are satisfied. The second order sufficiency condition is then implied by item 
iv) of Proposition 14.41 and Lemma l4~6l in the below. □ 



Lemma 4.6. Let u be a local minimizer of \i,fij satisfy il.3\) - [L4\ ), and 

L{x), G(x), H (x) be as defined preceding Proposition \4-4\ If G(u) has full rank, 
then JL7| ) holds at u if and only if detH(u) ^ 0. 

Proof. First, assume (|1.7p holds. Then, for rj > big enough, 

V := V 2 x L{u) + r]G(u) T G{u) y 0. 

By the matrix equation 



In 





one can see that 



r]G(u) 



V 2 x L(u) G(uf 
G(u) 

V 2 x L(u)+ v G{u) T G(u 
G(u) 



In 

G{uf 








%i+r 



dct H(u) = det(F) • det (-G(m)1/^ 1 G(u) t ) ^ 0, 

because of the positive definitcness of V and nonsingularity of G(u). 

Second, assume det H(u) ^ 0. Suppose otherwise (|1.7p fails. Then there exists 
0^d£ G(u) ± such that v T \7 2 L(0)v < 0. Since G(u) has full rank, the constraint 
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qualification condition holds at u. So, the second order necessity condition (|1.6[) is 
satisfied at u. It implies that v is a minimizer of the problem 

mm z T (v 2 x L(u)^z s.t. G(u)z = 0. 

By the first order optimality condition for the above, there exists v such that 
W xx L(u)v — G{u) T v, which then implies 

\l x L(u) G{uT 
G{u) 

This contradicts det H{u) 7^ 0, because v 7^ 0. So, (|1.7|l must hold at u. □ 

We conclude this section with the proof of Theorem 11.21 

Proof of Theorem \1.2{ Let tpi, . . . , ipL be the finite set of polynomials given in Con- 
dition [O] Theorem II .21 is then implied by Proposition 14.51 □ 

5. Some discussions 

Our main conclusions are Theorems 11.11 and 11.21 Lasserre's hierarchy has finite 
convergence when the constraint qualification, strict complementarity and second 
order sufficiency conditions hold at every global minimizer, under the archimedean 
condition. These optimality conditions are all satisfied at every local minimizer if 
the vector of coefficients of input polynomials lies in a Zariski open set. This gives 
a connection between the classical nonlinear programming theory and Lasserre's 
hierarchy of semidefinite relaxations in polynomial optimization. These results give 
an interpretation for the phenomenon that Lasserre's hierarchy often has finite 
convergence in solving polynomial optimization problems. 

Under the assumptions that Condition 12.21 holds at every u g K and K is 
irreducible and bounded, Marshall [17] Corollary 4.5] proved that, for each d > 2, 
the set 

{/ € K.[a;](j : / satisfies BHC at each global minimizer on K} 

is open and dense in R^]^. This interesting result can also be implied by Theorems 
11.21 and 13.11 Indeed, they can imply the following stronger conclusions: 

• the boundary hessian condition is satisfied in a Zariski open set in the space 
of input data (not every open dense set is Zariski open, e.g., M. n \Z n ); 

• Condition 12.21 also holds in a Zariski open set; 

• for the case d = 1, the boundary hessian condition also holds in a Zariski 
open set; 

• the defining polynomials for K are also allowed to be generic; the set K is 
not required to be irreducible or bounded. 

We would like to remark that Putinar's Positivstellensatz (cf. Theorem l2.1|) also 
holds generically for polynomials that are nonnegative on K . Assume (h) + Q(g) is 
archimedean and the ideal (h) is real. Let Pd(K) be the cone of polynomials in M.[x]d 
that are nonnegative on K, and dPd(K) be the boundary of Pd(K). Theorems II. 2 [ 
12 .41 and [3TT1 imply that if p lies generically on dPd(K) then p G (h) + Q{g). In [IT] . 
Lasserre interpreted Putinar's Positivstellensatz as a generalized KKT condition 
for global optimality. Therefore, the classical KKT conditions for local optimal- 
ity and the generalized KKT condition (i.e., Putinar's Positivstellensatz, under 
archimedeanness) for global optimality, both hold generically. 



= 0. 



OPTIMALITY CONDITIONS AND CONVERGENCE OF LASSERRE'S HIERARCHY 17 



A theoretically interesting question is whether there is a uniform bound on the 
number of steps to achieve hnite convergence for Lasserre's hierarchy in the generic 
case. That is, whether there exists an integer N, which only depends on the degree 
of / and a set of defining polynomials for K, such that = f m i n for all generic 
/ of a given degree and k > Nl Unfortunately, such a bound N typically does 
not exist. This could be implied by a result of Scheiderer [25] on the non-existence 
of degree bounds for weighted SOS representations. For instance, when K is the 
3-dimensional unit ball, such a bound does not exist (cf. [23 Section 5]). 

The archimedean condition cannot be removed in Theorems 11.11 12.41 For in- 
stance, consider the unconstrained optimization 



The origin is the unique minimizer. The archimedean condition failed, because 
the feasible set is the entire space R™ and is not compact. The objective / is the 
sum of the Motzkin polynomial and the positive definite quadratic form x T x. The 
second order sufficiency condition hold at 0. However, for all scalar 7, / — 7 is not 
SOS. In this case, Lasserre's hierarchy does not converge. 

The archimedean condition is not generically satisfied. To see this fact, consider 
the simple case that mi = and m 2 = 1. For a given d, let gf(d) be the set of 
polynomials g G M.[x]d such that Q(g) is archimedean. The set &/(d) is not dense 
in Rf^d. For instance, when d = 2, both g/(d) and its complement R[x]d\^(d) 
have nonempty interior: 

• Let 61 = x T x - 1. Clearly, b x £ R[x] 2 \^(2). For all q G R[x] 2 with 
sufficiently small coefficients, we have b\ + q £ &/(2). 

• Let b 2 = 1 - x T x. Clearly, b 2 G £?(2). For all q G R[x] 2 with small 
sufficiently coefficients, we have b 2 + q G s/(2). 

There exist polynomial optimization problems that Lasserre's hierarchy fails to 
have finite convergence, e.g., minimizing the Motzkin polynomial over the unit ball. 
Such problems always exist when the feasible set has dimension three or higher, as 
shown by Scheiderer [28]. So, we are also interested in methods that have finite 
convergence for optimizing all polynomials over a given set K . The Jacobian SDP 
relaxation is a method that has this property (cf. 20 ). 

Theorems 1 1 . 1 1 does not tell how to check when finite convergence happens. This 
can be done by using flat truncation, which is a rank condition on the dual optimiz- 
ers of (|1.2[) . Flat truncation is a sufficient condition for Lasserre's hierarchy to have 
finite convergence. In the generic case, flat truncation is also a necessary condition 
for Lasserre's hierarchy to have finite convergence (cf. [21)). 

No matter Lasserre's hierarchy has finite convergence or not, if there are finitely 
many global minimizers and the archimedean condition holds, then the flat trun- 
cation condition is always asymptotically satisfied (cf. |211 §3]). So, in numerical 
experiments, we might also observe that Lasserre's hierarchy has finite convergence 
even if it does not have in exact mathematical computations. However, if there 
are infinitely many global minimizers, the flat truncation condition is typically not 
satisfied (cf. Laurent [HI §6.6]). For instance, consider the problem 



The objective / is shifted from the Motzkin polynomial (i.e., f{x\,x 2l x 3 + 1) is 
the Motzkin polynomial). It has infinitely many global minimizers and fmin = 0. 



2 2/2, 2 o2\, 6, 2, 2 1 2 

mm x 1 x 2 (x 1 + x 2 — 6x 3 ) + x% + x 1 + x 2 + x 3 . 
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Lasserre's hierarchy for this problem does not have finite convergence. This can 
be implied by the proof of Prop. 6.1 of Scheiderer [27], because (0,0, 1) is a zero 
/ lying in the interior of the feasible set and / is a nonnegative but non-SOS 
form in (xx, %2,xa — !)• The flat truncation condition is typically not satisfied 
for dual optimizers of (jl.2|) . When GloptiPoly 3 is applied to solve this problem 
numerically, the convergence did not occur for the orders A; = 3,4, ...,12. 

Acknowledgement The author was partially supported by NSF grants DMS- 
0757212 and DMS-0844775. He would like very much to thank Murray Marshall 
for communications on the boundary hessian condition. 

Appendix A. Non-identically Vanishing of 3% and $ 

Given any positive degrees do, d\, . . . , dk, dk+i, we show that the polynomial 
8£(pq, . . . ,pk; q) defined in (|4.6j) and the polynomial @(j>o,pi, ■ ■ ■ ,Pk) defined in 
(|4.11jl do not vanish identically in pi £ K[x]rf 4 (i = 0, . . . , k) and q £ R[x]d k+1 - 
Without loss of generality, we can assume all d\, . . . , d& > 1 because linear con- 
straints in (|4.2I) can be removed by eliminating variables. 

First, we prove that the polynomial & defined in (|4.6|) does not vanish identically 
in the space K[x]d x ••• x K^]^ x R[x]d fc+1 . We only consider the case k < 
n, because if k = n then pi{x) = ■ ■ ■ = p~k{x) = q(x) has no nonzero complex 
solution in the generic case. By Proposition 14.11 it is enough to show that the 
homogeneous polynomial system (|4.5[) does not have a complex solution x ^ for 
generic po,pi, . ■ ■ ,Pk, q- We prove this in two cases: 

• (xo 7^ 0) We can scale as xq = 1, and the system (I4.5[) is then reduced to 
(14.41) . When Po,Pi, ■ ■ ■ ,Pk are generic, the set JC(p) defined in (|4.3[) is finite 
(cf. [HI Prop. 2.1]). Thus, when q is also generic, (|4.4|) does not have a 
solution in C™. 

• (x = 0) The system (|4.5|) is then reduced to 

f rank [V x pg(x) V x p\{x) •■■ V B pg(x)] < k, 
[ j 1 p$(x) = -..= P Ux) = q h (x) = 0. 

(Here, f h denotes the homogeneous part of the highest degree for a poly- 
nomial /.) When pi, . . . ,pk are generic, we have A(pJ, . . . ^ 0. By 
definition of A (cf. §2.4), if p\{x) =■■■= p^(x) = and x ^ 0, then 

rank [V ^{x) ■ ■ ■ V*&{xj\ - k. 

So, if x satisfies (|A. 1|) , there must exist scalars c\ , . . . , Ck such that 

V x p(j(x) = Cl W x p'i(x) + ■■■+ c k V x pl{x). 

Since each p\ is a form, by Euler's formula for homogeneous polynomials 
(cf. [H §2]), we can get 

k k 

doPo(x) = x T \7 x p%(x) = ^ax T V x p\{x) = ^2cidkPk( x ) = °- 

i=l i—l 

This means that (|A.1I) implies 

p %(x)=p>£(x) = ---=pUx)=0, 
rank [V x pft{x) V»pf(a;) ••■ V x p%{x)] = k. 
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Any x satisfying the above must be zero if A(pg,p5\ . . . ^ 0. 
Combining the above two cases, we know the polynomial system (|4.5I) has no com- 
plex solution i^O when po,pi, ■ • ■ ,Pk, q are generic. 

Second, we show that the polynomial @(po,pi, • ■ • ,Pk) defined in (|4.11|) does not 
identically vanish in the space K[x]d x K[a;]<ii x • • • x K.[x]d fc . By Proposition 14. 2\ 
it is enough to prove that there exist p. L e Iji]^ (i — 0, . . . , k) such that (|4. 10[) has 
no complex solution (x, y) with x ^ 0, y ^ 0. We prove this in two cases. 

• (x 7^ 0) We scale as x = 1, and (|4.10l) is then reduced to (|4.9I) . Choose 
polynomials Pi as follows: 

pa := /o £ M[sGk+i, . . . , x„]d , pi := a^ 1 — 1, . . . ,f>jt := x d k k - 1. 

Clearly, on the variety V(p\, . . . ,p k ), the gradients V x pi, . . . , V x pk are lin- 
early independent, and so are 



v x pi 














(i = l,...,fe). 



Thus, (|4.9p is equivalent to (|4.8|) . If (x, A) is a critical pair, then Ai = 
••• = At =0 and D := diag^a^ 1- , . . . , dfca;^'" 1 ) is invertible. Denote 
xj := (xi,...,Xfc,) and a;// := (aifc+i, . . . , x n ). Note that (p := {p 0l ...,p k )) 











£> " 


H p (x,0) = 





















(In the above, the 0's denote zero matrices of proper dimensions.) The ma- 
trix Hp(x, 0) is nonsingular if and only if Vi£ /o is nonsingular. Therefore, 
(|4.8p has a solution if and only if there exists u E C"~ fe satisfying 

V,„/oM = 0, det Wl u h{u) = 0. 

However, the above is possible only if 

^ ^)=0. 

dx k+ i dx n 

So, if fo is generic, then Hp{x, 0) is nonsingular for all (x, A) satisfying (|4.2p 
corresponding to po,Pi, • • • ,f>fc- 

By continuity of roots of polynomials, H p (x, A) is nonsingular for every 
pair (x, A) satisfying (14.21) . if each pi is generic and close enough to pi. 
• (xo = 0) The polynomial system (|4.10[) is then reduced to 

f rankQ(a;,y) <2k, p\ {x) = • • • = pfc(x) = 0, 
1 (V^(x)) T y=--- = (V^(x)) T y = 0. 

In the above, Q{x, y) denotes the matrix 

.(vM)y ••• (vM)» v x rf ■■• v,^ • 

We show that if Po,Pi, ■ ■ ■ ,Pk & re generic, then (|A.2[) has no complex so- 
lution (x,y) with x 7^ 0,y ^ 0. When all are generic, for every a; 7^ 
satisfying 

p'Z(x)=---=p h k (x) = 0, 
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the gradients ^ x Pi, • ■ • , V x p£ are linearly independent. When rank Q(x, y) < 
2k, there exist scalars ci, . . . , such that 

k 

(A.3) V^O) - H CjV,i#(aO = 0. 

i=l 

By Euler's formula for homogeneous polynomials, the above implies 

k k 
d oP %(x) = x T V xP %{x) = \x T V xP %{x) = ^idiPiix) = 0. 

i=l i=l 

This means that if some x ^ satisfies (|A.2[) then the polynomial system 
p h (x)=p h 1 (x)=---=p h k (x)=0 

is singular. But this is impossible unless A( P q, P i, . . . ,pi) = 0. 
Combining the above two cases, we know that there exist polynomials pi G Klx]^ (i ~ 
0, . . . , k) such that there are no complex x ^ 0, y ^ satisfying (I4.10[) . This shows 
that 3$(pq, . . . ,Pk) does not identically vanish. 
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